Picture for Wei Wu

Wei Wu

SEAL: Synergistic Co-Evolution of Agents and Learning Environments

Add code
May 23, 2026
Viaarxiv icon

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Add code
May 20, 2026
Viaarxiv icon

PEARL: Unbiased Percentile Estimation via Contrastive Learning for Industrial-Scale Livestream Recommendation

Add code
May 20, 2026
Viaarxiv icon

OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

Add code
May 19, 2026
Viaarxiv icon

Natural Gradient Bayesian Filtering: Geometry-Aware Filter for Dynamical Systems

Add code
May 04, 2026
Viaarxiv icon

Scaling Human-AI Coding Collaboration Requires a Governable Consensus Layer

Add code
Apr 20, 2026
Viaarxiv icon

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Add code
Apr 11, 2026
Viaarxiv icon

Training the Knowledge Base through Evidence Distillation and Write-Back Enrichment

Add code
Mar 26, 2026
Viaarxiv icon

Aligning Large Language Models with Searcher Preferences

Add code
Mar 11, 2026
Viaarxiv icon

Turning Semantics into Topology: LLM-Driven Attribute Augmentation for Collaborative Filtering

Add code
Feb 24, 2026
Viaarxiv icon